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1. INTRODUCTION 

Nowadays, digital networks are so developed that they have become an essential communication 
mechanism. They allow the transmission of all kinds of digital data: text, sound, and mainly images. The 
increased use of multimedia applications raises more and more problems concerning the preservation of the 
confidentiality and authenticity of these digital data, in particular images which must be protected from any 
falsification [1]. The appropriate solution to this problem is the use of the fragile watermarking methods. 
These methods are suitable for verifaying the integrity and awner authenticity. The fragile watermark can be 
distorted and the hidden information is lost or modified as soon as the host image undergoes a modification 
[2]. Thereby, the loss of the watermark or its alteration will be taken as a proof that the data has been 
modified, while the recovery of the watermark is used to demonstrate the integrity of data. The watermarking 
algorithms found in the literature can be divided according to the domain of insertion into two categories [3]: 
watermarking in spatial domain and watermarking in spectral domain. 

In the spatial domain, the watermark is inserted by directly modifying the pixel values of the host 
image [4]. Several approaches have been developed in this domain. Among these algorithms we find the 
conventional least significant bit (LSB) method [5], which is used because of its simplicity and its little effect 
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on the image [6]; the LSB of the host image is replaced by the most significant bit (MSB) of the watermark, 
these changes cannot be perceived by the human visibility system. While in the approaches related to the 
spectral domain, the embedding part is carried out after the transformation of the host image using one of the 
following transforms or a combination between them [7], [8]: discrete fourier transform (DFT), discrete 
cosine transform (DCT), and discrete wavelets transform (DWT). 

Spectral domain-based watermarking approaches are known for their good robustness against 
multiple attacks [9], [10]. Many research works have been carried out, Al-ardhi et al. [11] used a 
watermarking scheme based on linear cellular automata transform (LCAT) algorithm operating in the 
spectral domain to protect the vector maps under copyright, the proposed scheme presents robustness against 
geometric attacks, namely, translation, scaling, and rotation. To reach the same goal, Neyman et al. [12] 
proposed a watermarking algorithm based on the fast fourier transform (FFT) to embed a copyright marker 
into vector map. 

However, the spatial domain watermarking techniques are computationally simpler, they offer a 
good impercptability but they provide less protection against attacks, namely, geometric attacks such as 
scaling, printing and scanning, changing ratio, and cropping. Several algorithms were previously presented 
regarding this matter; Many watermarking methods developed in the past employed the singular value 
decomposition (SVD) [13], [14]. Shehab et al. [15] proposed an SVD based fragile watermarking scheme 
using grouped block method offering more security and providing the possibility to locate the attacked areas 
inside different medical images. Bal et al. [16] presented an LSB watermarking procedure based on bit pair 
similarity, to make the watermark more secure the authors utilized symmetric key cryptography. Kumar and 
Singh [17] proposed a novel scheme based on the encryption of the watermark image by using the Hill 
Cipher method before the embedding procedure, the robustness of this method was demonstrated by testing 
multiple attacks. Zhang et al. [18] proposed a robust image watermarking scheme based on mathematical 
features of SVD in the spatial domain and Arnold's transformation to encrypt the watermark. To secure 
medical images in digital imaging and communications in medicine (DICOM) format, Ayu et al. [19] 
proposed a digital watermarking technique based on the encryption of the DICOM confidentiality tags using 
the AES256 encryption algorithm and the insertion into the 2nd LSB of the medical image, the proposed 
technique maintains image quality and resist to several malicious attacks. In order to provide a full security 
system for medical information (imaging and report) in terms of confidentiality, authentication and integrity, 
Boussif et al. [20] proposed an approach which combine a semi reversible build watermarking method robust 
to JPEG compression, build fragile watermarking, and a stream cipher symmetric encryption algorithm. Joshi 
et al. [21] presented a detailed study of LSB method using different secret message sizes to analyze its 
impact on the PSNR and MSE metrics. Rinki et al. [22] proposed a new LSB digital watermarking scheme 
based on the selection of the non-consecutive matrix of pixels to replace the last three bits of each RGB 
components of the host image by the bits of the watermark image, the authors claimed that this scheme 
enhance robustness, imperceptibility, and security. Sowmya et al. [23] proposed an algorithm based on the 
embedding of a watermark into another watermark in addition to encrypting the main watermark using A5/1 
encryption algorithm, then the encrypted watermark is inserted into the two LSBs of the host image green 
and blue components. This scheme enabled increasing embedding capacity and safety. 

In order to combine the good features of the two previous categories, hybrid domain-based 
watermarking approaches are investigated to get both robustness and imperceptibility. Kumar et al. [24] 
proposed a method combining embedding in wavelet domain and spatial domain to improve authenticity, 
security and copyright protection. In addition to classify watermarking algorithms according to the domain of 
insertion, watermarking methods may be also classified as strong watermarking [14] and fragile 
watermarking [15]. The strong watermarking method allows extraction of hidden watermarks from 
watermarked images, even after image processing such as image compression and filtering; thus, it can be 
exploited to verify copyright. The fragile watermarking technique produces a very sensitive watermark 
designed to detect every possible change in marked image; thus, it can accurately detect the tampered area 
and assure integrity verification and authenticity. As far as we know, Walton [25] was the first to propose a 
complete fragile watermarking scheme for image tampering detection. The author has chosen to insert 
"checksums" control values into the least significant bits of each pixel. This technique presents a high 
probability of tamper detecting but it is vulnerable to the block swap attacks 

This paper proposes a new fragile watermarking scheme based on LSB substitution, optimal block 
selection strategy, and Huffman encoding. Our proposed method can be classified as a fagile hybrid domain- 
based watermarking technique. The experimental results show that Huffman encryption and optimal blocs 
selection offer good features of security and imperceptibility. The remainder of this paper is organized as 
follows. Section 2 presents the proposed method in detail. Section 3 clarifies experiment results as well as 
some discussions. Conclusions are given in section 4. 
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2. METHOD 

In this section, we propose a novel scheme for digital image watermarking based on combining 
Huffman encoding with optimal embedding strategy to improve both security and imperceptibility. Security 
of the watermark is maintained since it cannot be extracted without knowing the decoding rules and Huffman 
table. Furthermore, specifics blocks were selected for embedding according to the maximum entropy value. 


2.1. Huffman coding 

Huffman coding is a lossless data compression algorithm developed by Huffman [26] in 1952. It 
uses a variable length code to represent a source symbol (pixels of the watermark in our case). The most 
frequent characters (with the highest occurrence) are encoded with the smallest binary codes, so the space 
used to encode them is minimal, which increases the compression rate. The purpose of the Huffman 
algorithm is to reduce the number of bits used for encoding frequent characters and to increase this number 
for unfrequented ones [27]. As shown in Figure 1, using Huffman encoding and the Huffman table, the 
watermark (logo image) is transformed into a bitstream of 0 and | values that can be represented as an image 
to produce a scrambled watermark. 

The scrambled watermark and the secret key (Huffman encoding and the Huffman table) are used to 
restore the pixel values to their original values in the watermark as shown in Figure 2, which is then used to 
verify the suspected digital image. The encryption step using Huffman encoding has two advantages; the first 
is securing the watermark and the second is the compression of the watermark before embedding to reduce 
the distortion. Furthermore, embedding in specif blocs of the image may improve the impercibility and the 
quality of the watermarked image. 


The Watermark | 
Huffman code of the 


FN Huffman Watermark 
; +++ 101100010101000001 100... 
S Encoding 


Figure |. Huffman encoding 


| The Watermark 


Huffman code of the 
Watermark Huffman (as 
101100010101000001100..... _ . e--> 
Decoding S 


Figure 2. Huffman decoding 


2.2. Optimal embedding strategy 

Considering the characteristics of the human eyes visual system, we invistigated embedding the 
scrambled watermark information in specific blocs of the host image according to the entropy value. The 
entropy measures the degree of randomness in the image [28], [29]. It describes how much uncertainty or 
randomness there is in an image. It is given as: 


E(x) = —Y p(x)logp(x) (1) 


Here p(x) is the probability of occurrence of ‘x’ in image E(x). Human eyes are less sensitive to high entropy 
region so optimal perceptibility is obtained when embedding is performed in the bloc with maximum entropy 
value. 

The overall process of the proposed watermarking scheme is presented in Figure 3. First, the host 
image (standard Lena image) is divided into nonoverlapping blocks to select the most suitable one for 
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embedding according to the maximum entropy value. Secondly, the scrambled watermark is embedded in the 
first LSB of the host image bloc, then eventually in the second and third LSB as needed depending on the 
length of the bitstream. Finally, the watermark extraction procedure is the direct reversal of the watermark 
embedding procedure; the scrambled watermark image is recovered, then using the Hoffman decoding rules 
we extract the original watermark image. 


Watermar * Encrypted Watermark 


Watermark Extracted 


\ — Extraction of the Watermark 


Figure 3. Watermark encryption, embedding, and extraction flow diagram 


3. RESULTS AND DISCUSSION 

In order to evaluate the effectiveness of the proposed scheme and mesure the effect of combining 
Huffman encoding with optimal embedding strategy, peak signal to noise ratio (PSNR), structural index 
similarity (SSIM), and normalized correlation coefficient (NC) of the extracted watermark with respect to the 
original watermark were calculated. The perceptibility is evaluated through PSNR that measures the degree 
of similarity between the original image and the watermarked image, the PSNR is defined as (2). 


2 
PSNR = 10log1o (= ) (2) 


MSE 


The MSE (mean squared error) is obtained using the (3): 


— Emnll1(nn)=12¢mn) |" 
~ M*N 


MSE (3) 
where M and N are the number of rows and columns in the input images respectively and I1(m, n) is the 
original image, I2(m, n) is the watermarked image. 

The SSIM is defined in RGB color images as (4): 


SSIM(i, i’) = Gi’) ci’) si, i’) (4) 


where i is the original image and i’ is the distorted image. l(i,i’), c(i,i’) and s(i,i’) are respectively the 
functions that compare luminance, contrast and structures of image i and image 1’ [30]. 

The idea behind SSIM is to measure the structural similarity between the two images, rather than a 
pixel to pixel difference like the PSNR does [31], [32]. SSIM value should ideally be similar to unity. The 
robustness of the proposed algorithm is analyzed by using normalized cross correlation (NC), it is a metric to 
evaluate the degree of similarity between the original watermark and the extracted watermark. The equation 
to compute NC is given in (5): 


NC = (Yiko Dio OW * EW)/ (Lito Lj=0 OW * OW) (5) 


OW and EW are, respectively, the original watermark and the extracted watermark. 

We start our experiments by testing the proposed scheme using the 512x512 size standard Lena 
image as host image and the 64x64 size logo, thorax, leaf, and floor images, respectively, as watermarks 
(Figure 4). Table 1 shows the good values of PSNR, SSIM, and NC, they reach values of 63.12dB, 0.9999, 
and 0.998, respectively, for the embedding of the thorax image. It is due to the optimal embedding strategy 
that selects bloc with maximum value of entropy; human eyes are less sensitive to high entropy region. 
Furthermore, transforming the watermark into a bitstream of 0 and 1 values and inserting them in LBS of the 
selected bloc reduce severely the distortion of the host image. Embedding in the first LSB can modify the 
value of the pixel at most by 1. Depending on the length of the bitstream, if we need to continue embedding 
in the second or in the third LSB of the host image, pixels values can be modified only and at most by 3 and 
7 respectively. 
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Table 1. The three metrics values of embedding the four watermarks in the host image Lena 
Watermark PSNR(dB) —SSIM NC 


Logo 56.2 0.9993 0.975 
Thorax 63.12 0.9999 0.998 
Leaf 57.03 0.9995 0.937 
Floor 58.24 0.9997 0.943 


In order to assess the image distortion or more specifically the modification of pixels values in the 
host image, we calculate the image histogram. It is defined as the plot of the frequency of each intensity 
value in the image. The image histogram can be written as (6): 


H,(k) =J;k € {0,255} (6) 
where i is an image of MxN dimension and J represent how many times the gray level k occurs. 
NC = (Yito Lio OW * EW)/ (Lito Lj=0 OW * OW) (7) 


Three standard images (shown in Figure 4) have been chosen from the USC-SIPI image database [33], the 
same previous watermarks are used for embedding to investigate furthermore the proposed scheme. 


Boat Leaf 
- FP - 
7 Pt ome, J 4 
House Floor 
ee 
' 
Airplane Thorax 


Figure 4. Host images and watermark images 


Table 2 shows the high values of PSNR and SSIM that are greater than 56 dB and 0.9993 
respectively which indicate that the quality of the watermarked images is very good and the degradation 
suffered due to the embedding process is generally imperceptible. Furthermore, the values of NC are greater 
than 0.937, NC is used to calculate the correlation of the original watermark with the extracted watermark. 
NC results are very close to | when no attack is applied and close to 0 when applying any perturbation. This 
means that the proposed algorithm is very fragile to any perturbation that can affect it. 

Figure 5 shows the good imperciptibility of the produced images using the proposed method by 
comparing visual quality of the original images to the watermarked images and the original watermarks to the 
extracted watermarks using Lena/logo Figure 5(a), boat/leaf Figure 5(b), house/floor Figure 5(c), and 
airplane/thorax Figure 5(d) as host images and watermarks respectively. The high visual quality is due to the 
little modification carried out during embedding process; only few pixels are modified and at most by 7 if we 
need to embed in the third LSB. 


Table 2. PSNR, SSIM, and NC 
Host image/watermark PSNR (dB) _ SSIM NC 


Boat/leaf 56.9 0.9997 0.937 
House/floor 58.61 0.9999 = 0.943 
Airplane/thorax 63.19 0.9999 0.998 
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The image histogram plots the frequency of each intensity value in the image. Figure 6 shows 
that all histograms of watermarked images are similar to those of host images using Lena Figure 6(a), Boat 


Figure 6(b), house Figure 6(c), and airplane Figure 6(d) as host images. This close similarity indicates that 
the proposed method preserves the image quality without distortion. 


rigina Mark Original image 


G 


Extracted Mark; NC=O.97477 
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Figure 5. Comparing visual quality of the original images to the watermarked images and the original 
watermarks to the extracted watermarks using (a) Lena/logo, (b) boat/leaf, (c) house/floor, 
and (d) airplane/thorax 
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Figure 6. Comparing histograms of the original host image to those of the watemerked image for 
(a) Lena, (b) boat, (c) house, and (d) airplane 
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4. CONCLUSION 

In this paper, we present a novel secure fragile watermarking method based on Huffman encoding 
and optimal embedding strategy. First, we encrypt the watermark image using Huffman encoding to produce 
a scrambled watermark. Then the host image is divided into nonoverlapping blocks to select the most suitable 
one for embedding according to the maximum entropy value. The scrambled watermark is embedded in the 
LSB of the selected bloc of the host image depending on the length of the bits stream. Since the proposed 
method is a fragile watermarking method, the watermark information will be destroyed by any image 
processing such as compression or filtering. The main goal of encrypting the watermark before embedding is 
increasing safety and security. 

In this scheme, the requirements are achieved by embedding encrypted watermarks in higher 
entropy area of host images. The experimental results prove that this proposed technique provides high 
security and high imperceptibility. The future work will be focused on video watermarking using the 
extension of this method to differents frames of videos. 
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